Approximation of Grammar-Based Compression via Recompression

نویسنده

  • Artur Jez
چکیده

In this paper we present a simple linear-time algorithm constructing a context-free grammar of size O(g log(N/g)) for the input string, where N is the size of the input string and g the size of the optimal grammar generating this string. The algorithm works for arbitrary size alphabets, but the running time is linear assuming that the alphabet Σ of the input string can be identified with numbers from {1, . . . , N} for some constant c. Otherwise, additional cost of O(n log |Σ|) is needed. Algorithms with such an approximation guarantee and running time are known, the novelty of this paper is a particular simplicity of the algorithm as well as the analysis of the algorithm, which uses a general technique of recompression recently introduced by the author. Furthermore, contrary to the previous results, this work does not use the LZ representation of the input string in the construction, nor in the analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approximation of smallest linear tree grammar

A simple linear-time algorithm for constructing a linear context-free tree grammar of size O(rg + rg log(n/rg)) for a given input tree T of size n is presented, where g is the size of a minimal linear context-free tree grammar for T , and r is the maximal rank of symbols in T (which is a constant in many applications). This is the first example of a grammar-based tree compression algorithm with...

متن کامل

A Space-Saving Approximation Algorithm for Grammar-Based Compression

A space-efficient approximation algorithm for the grammar-based compression problem, which requests for a given string to find a smallest context-free grammar deriving the string, is presented. For the input length n and an optimum CFG size g, the algorithm consumes only O(g log g) space and O(n log∗n) time to achieve O((log∗n) logn) approximation ratio to the optimum compression, where log∗n i...

متن کامل

Bone cement distribution in the vertebral body affects chances of recompression after percutaneous vertebroplasty treatment in elderly patients with osteoporotic vertebral compression fractures

OBJECTIVE Percutaneous vertebroplasty (PVP) is a surgical procedure that has been widely used to treat patients suffering from osteoporotic vertebral compression fractures (OVCFs). The procedure involves injection of bone cement into a fractured vertebra. In this study, we investigated whether the distribution of the cement in the vertebral body is related to the occurrence of recompression aft...

متن کامل

Application of Lempel-Ziv Factorization to the Approximation of Grammar-Based Compression

We present almost linear time (O(n · log |Σ|) time) O(log n)ratio approximation of minimal grammar-based compression of a given string of length n over an alphabet Σ and O(k · log n) time transformation of LZ77 encoding of size k into a grammar-based encoding of size O(k · log n). Computing exact size of the minimal grammar-based compression is known to be NP -complete. The basic novel tool is ...

متن کامل

Approximation algorithms for grammar-based data compression

This thesis considers the smallest grammar problem: find the smallest context-free grammar that generates exactly one given string. We show that this problem is intractable, and so our objective is to find approximation algorithms. This simple question is connected to many areas of research. Most importantly, there is a link to data compression; instead of storing a long string, one can store a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Theor. Comput. Sci.

دوره 592  شماره 

صفحات  -

تاریخ انتشار 2013